Distributed Planning for Large Teams
نویسنده
چکیده
In many domains, teams of hundreds of agents must cooperatively plan to perform tasks in a complex, uncertain environment. Naively, this requires that each agent take into account every teammates’ state, observation, and choice of action when making decisions about its own actions. This results in a huge joint policy space over which it is computationally intractable to find solutions. In certain problems, however, searching this complete space may not be necessary. Specifically, there are problems in which individual agents usually act independently, but have a few combinations of states and actions in which they share a non-factorable transition, reward, or observation function with one or more teammates. This thesis focuses on exploiting this structure, along with two other properties that are often present in these cases, to greatly improve planning efficiency. First, while there are a large number of possible interactions between agents, the number of interactions that actually occur in a particular solution instance is often quite small. It is therefore possible to disregard many irrelevant combinations of interactions by dynamically handling only those that arise during the planning process. Second, in the case of intelligent agents, computational power itself is often distributed across the team. Thus, distributed approaches have access to computational resources that grow linearly with team size, making it easier to scale to very large teams. Taking advantage of these properties, we propose DIMS, a framework in which agents plan iteratively and concurrently over independent local models which are then shaped by the expected observations, movements and rewards of their teammates. By dynamically discovering relevant interactions and distributing computation, planning efficiency is greatly improved, allowing joint solutions to be computed for teams into the hundreds of agents. Initial experiments have been conducted in a simplified urban search and rescue domain. These experiments demonstrate the promise of the approach. To complete the thesis work, we will extend these experiments to urban search and rescue planning and a humanitarian convoy planning task. In each domain, a low-fidelity, large scale model and a high-fidelity, real-time physical simulation model are constructed. Empirical results over these four conditions verify the solution quality, scalability and practicality of the framework.
منابع مشابه
Multi-team Facilitation of Very Largescale Distributed Meetings
Distributed work teams routinely use virtual meetings to support their collaborative work. In this paper, we present a case study of the facilitation that was provided for a very large-scale distributed meeting. Small teams of facilitators were recruited, trained, and assigned to each of six discussion forums of ManagerJam, a 48 hour meeting of over 8,000 managers in a large global technology c...
متن کاملDistributed Path Consensus Algorithm
Path planning methods based on discrete graph searches are highly effective in dealing with large-scale environments, arbitrarily shaped obstacles, frequent map updates and complex cost functions. When it comes to planning for teams of coupled agents however, typical graph searchbased planning techniques quickly become infeasible due to the exponential growth of the dimensionality of the joint ...
متن کاملAn Observational Study of a Distributed Card Based Planning Environment
Providing support for distributed agile teams as they conduct planning meetings is tricky. In distributed settings, the use of paper index cards for planning isn’t convenient, as some of the team members do not have access to the physical cards. We present the results of an observational study where a distributed card based planning tool was evaluated against planning with physical cards. The f...
متن کاملDistributed Continual Planning for Unmanned Ground Vehicle Teams
tance of distributed continual planning concepts; coordinating teams of unmanned ground vehicles in dynamic environments is an example of such a domain. In this article, I illustrate the ideas in, and promises of, distributed continual planning by showing how acquiring and distributing operator intent among multiple semiautonomous vehicles supports ongoing, cooperative mission elaboration and r...
متن کاملEnabling Flexible Human Strategic Guidance for Multi-Agent Planning and Scheduling in Dynamic Uncertain Domains
Creating decision support systems to help people coordinate in the real world is difficult because it requires simultaneously addressing planning, scheduling, uncertainty and distribution. Generic AI approaches produce inadequate solutions because they cannot leverage the structure of domains and the intuition that end-users have for solving particular problem instances. We present a general ap...
متن کامل